Speech segmentation without speech recognition
نویسندگان
چکیده
In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various background and environment. Three feature sets, which include pause, rate of speech and prosody, are used to discriminate the sentence boundary. Experiments on broadcasting news indicate that the performance of proposed algorithm is satisfying.
منابع مشابه
مقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملWavelet Transforms for Non - Uniform Speech Recognition Systems
A new algorithm for non-uniform speech segmentation and its application in speech recognition systems is presented. A new method based on the Modulated Gaussian Wavelet Transform based Speech Analyser (MGWTSA) and the subsequent Parametrization block is used to transform a uniformly signal into a set of non-uniformly separated frames, with the accurate information to be fed to our speech recogn...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملWavelet transforms for non-uniform speech recogntion systems
A new algorithm for non-uniform speech segmentation and its application in speech recognition systems is presented. A new method based on the Modulated Gaussian Wavelet Transform based Speech Analyser (MGWTSA) and the subsequent Parametrization block is used to transform a uniformly signal into a set of non-uniformly separated frames, with the accurate information to be fed to our speech recogn...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003